AMENDMENTS TO THE SPECIFICATION 
Please amend the following numbered paragraphs in the specification to read as follows: 

[0005] U.S. Pat. No. 5,9 4 0,188 5.940.118 to Van Schyndel teaches automatic tracking 

using an optical transducer that takes visual cues (e.g., a moving mouth) to point the camera 
toward the location of the speaker. The method requires a very advanced processor, an optical 
transducer, and is subject to many false signals (e.g., one participant whispering to a neighbor). 


[0038] As an example of steps 410 and 420, FIG. 5 shows videoconferencing system 

140, two potential audio sources 510 and 515, and five microphones 520, 522, 524, $46 526 . and 
528. Once pan 530, tilt 540, and zoom 550 of camera 160 are known for each potential source 
/ 510 and 515, the distance of each potential source 510 and 515 to each microphone 520, 522, 
524, 546 526 . and 528 is determined by applying trigonometric principles. Of course, the 
coordinates of each microphone 520, 522, 524, 526, and 528 in relation to camera 160 would 
need to be either known or calculated. 


[0049] In the last step 470 a decision is made whether to continue camera tracking. 

Camera tracking can be performed only once, be performed only at certain intervals, be 
continuous, or only be performed only if certain conditions are met (e.g., when sound is 
received). Unnecessary camera movement may be avoided by waiting until a certain amount of 
time has elapsed before moving the camera to ensure a new person is speaking, thereby limiting 
erroneous tracking due to coughing or errant comments. Camera tracking can also be performed 
by using multiple cameras and only activating the camera that frames the speaker or by using a 
PTZ (pan/tilt/zoom) camera (mechanical or electronic). 
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